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Abstract Severe acute respiratory syndrome coronavirus 
(SARS-CoV) encodes a highly basic nucleocapsid (N) protein 
of 422 amino acids. Similar to other coronavirus N proteins, 
SARS-CoV N protein is predicted to be phosphorylated and 
may contain nuclear localization signals, serine/arginine-rich 
motif, RNA binding domain and regions responsible for self- 
association and homo-oligomerization. In this study, we demon- 
strate that the protein is posttranslationally modified by covalent 
attachment to the small ubiquitin-like modifier. The major 
sumoylation site was mapped to the “lysine residue of the N pro- 
tein. Further expression and characterization of wild type N pro- 
tein and K62A mutant reveal that sumoylation of the N protein 
drastically promotes its homo-oligomerization, and plays certain 
roles in the N protein-mediated interference of host cell division. 
This is the first report showing that a coronavirus N protein 
undergoes posttranslational modification by sumoylation, and 
the functional implication of this modification in the formation 
of coronavirus ribouncleoprotein complex, virion assembly and 
virus—host interactions. 

© 2005 Federation of European Biochemical Societies. Published 
by Elsevier B.V. All rights reserved. 
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1. Introduction 


A novel coronavirus was identified as the etiological agent of 
severe acute respiratory syndrome (SARS) [6,13,17,21]. SARS 
coronavirus (SARS-CoV) is an enveloped virus with a single 
strand, positive-sense RNA genome of 29.7 kb in length. In 
SARS-CoV-infected cells, a 3’-coterminal nested set of nine 
mRNAs species, including the genome-length mRNA 
(mRNA1) and eight subgenomic mRNA species (nRNA2— 
9), is expressed. The genome-length mRNAI encodes two 
overlapping replicase proteins in the form of polyproteins la 
and la/b, which are processed by virus-encoded proteinases 
into at least 16 putative nonstructural proteins (NSP1- 
NSP16) [24,29]. The four structural proteins, spike (S), enve- 
lope (E), membrane (M) and nucleocapsid (N), are encoded 
by subgenomic mRNA 2, 4, 5, and 9, respectively. In addition, 
eight putative nonstructural proteins, 3a, 3b, 6, 7a, 7b, 8a, 8b, 
9b, are encoded by subgenomic mRNA3, 6, 7, 8, and 9 [24,29]. 
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All coronaviruses encode an extensively phosphorylated, 
highly basic protein. It varies from 377 to 455 amino acids in 
length and has high serine content (7-11%), which are poten- 
tial targets for phosphorylation. Although the primary se- 
quence conservation of the N proteins within the genus is 
low, three structural domains can be identified based on se- 
quence comparisons [14]. Of which the middle domain is a po- 
tential RNA-binding domain, capable of binding both 
coronavirus- and non-coronavirus-derived RNA sequences in 
vitro [26]. The functions of domains I and III remain un- 
known. However, putative motifs for ribosome binding and 
nucleolar localization signals (NuLs) have been identified in 
domain III region [31]. In cells expressing the N protein, it 
localizes either to the cytoplasm alone or to the cytoplasm 
and nucleolus [9]. This nucleolar localization has been shown 
to be a common feature of the coronavirus family [31]. Multi- 
ple functions have been postulated for the coronavirus N pro- 
tein throughout the virus life cycle. Primarily, the protein can 
be associated with the genomic RNA to form a ribonucleopro- 
tein complex (RNP) and viral core [5,7,19,20]. It plays an 
important role in the replication of the genomic RNA [3], 
and in the transcription and translation of subgenomic RNAs 
(sgRNA) [1,26,28]. In addition, N protein might inhibit host 
cell proliferation or delay cell growth, possibly by disrupting 
cytokinesis [4,31]. The protein is generally phosphorylated at 
multiple positions and is one of the most abundant structural 
proteins [14]. It can also stimulate strong humoral and cellular 
immune response, making it a potential vaccine candidate [12]. 

Similar to other coronaviruses, SARS-CoV N protein is a 
highly basic protein (Fig. 1A). Among its 422 amino acids, 
there are several basic amino acid-rich regions, that may func- 
tion as NuLs and RNA-binding motifs, and a serine/arginine 
(S/R) rich motif (Fig. 1A). Sequence comparison and available 
evidence showed that the N-terminal one third region (from 
amino acid 49 to 178) may contain the RNA binding domain 
and the C-terminal half (amino acid 213 to 422) may be 
responsible for self-association and homo-oligomerization of 
the protein (Fig. 1A) [11,27]. 

In this study, the SARS-CoV N protein was cloned and ex- 
pressed in bacterial and mammalian cells. In Escherichia coli 
BL21 cells, the protein was expressed as a single protein spe- 
cies. However, multiple protein bands with a wide range of 
molecular masses were detected when the N protein was 
expressed in mammalian cells, indicating that it may have 
undergone posttranslational modification in addition to phos- 
phorylation and may form homo-oligomers. Biochemical char- 
acterization and mutagenesis studies demonstrated that the 
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Fig. 1. (A) Schematic representation of the SARS-CoV N protein, showing the SUMO-1 conjugation site (K62), three putative nucleolar localization 
signals, the S/R-rich motif, the RNA-binding domain and self-association domain. (B) Analysis of the expression of N protein in bacterial and 
mammalian cells. Plasmids pGEX-5X-1 and pGEX-N were transformed into E. co/i BL21, and expressed by induction with IPTG. The GST (lane 1) 
and GST-N (lane 2) proteins were affinity-purified and separated on an SDS—10% polyacrylamide gel. The protein was visualized by staining the gel 
with Coomassie blue. Total cell lysates prepared from HeLa cells transfected with pK T0-Flag empty plasmid (lane 3) and Flag-tagged N protein (lane 
4) were separated by SDS-PAGE and analyzed by Western blotting with anti-Flag antibody. Numbers on the left and right indicate molecular 
masses in kilodaltons. 


protein was posttranslationally modified by covalent attach- 2.3. Western blot analysis 
ment to the small ubiquitin-like modifier (SUMO). The Total protein from transfected HeLa cells was lysed with 2x SDS 


: : oe : loading buffer and subjected to 10% SDS-PAGE. Protein was trans- 
sumoylation site was mapped to the “lysine residue of the N ferred to PVDF membrane (Bio-Rad) by using a semi-dry transfer 


protein. Further expression and characterization of wild type cell (Bio-Rad, Trans-blot SD), and blocked overnight at room 
N protein and K62A mutant revealed that sumoylation drasti- temperature with 10% nonfat milk in PBS-T. The membrane was 
cally promotes homo-oligomerization of the protein. Sumoyla- probed with anti-SARS N, anti-Flag, or anti-Myc antibodies fol- 


lowed by anti-mouse or anti-rabbit antibodies conjugated with 


tion of the N protein may also play certain roles in its : ; : ee 
interf f ‘ er et ae P fe If ati a horseradish peroxidase (Sigma). Membrane-bound antibodies were 
I RTeTIeren ce: OF HOSL.20e LY ODT anaes an detected with the SuperSignal west pico chemiluminescence substrate 


homo-oligomerization of the N protein are essential for the kit (Pierce). 
formation of viral RNP and nucleocapsid assembly, sumoyla- 
tion of this protein may play an important regulatory role in 


: ee 2.4. Immunoprecipitation 
the SARS-CoV replication cycles. Transiently transfected HeLa cells in 100-mm dishes were lysed in 1 


ml of lysis buffer (150 mM NaCl, 1% NP-40, and 50 mM Tris-HCl, pH 

8.0) with 0.5% protease inhibitor cocktail (Sigma). The lysates were 

. centrifuged at 12000 rpm for 20 min at 4°C. The supernatants were 

ap DAatoraland suet added with anti-SUMO-1 (Zymed), anti-Flag M2 (Stratagene), or 
5) Role And vel euliase anti-Myc (Biomed Diagnostics) antibodies at 4°C for 2h. Protein-A 
sect , ; : ‘ . : agarose beads (40 tl) (KPL) were added to the lysates and incubated 
HeLa cells were cultured in complete Dulbecco’s modified Eagle’s with shaking for 1 h at 4°C. The beads were collected by centrifuga- 


medium (Invitrogen) supplemented with 10% newborn calf serum tion and washed for three times with RIPA buffer (150mM NaCl 
(Sterile) and 1% penicillin/streptomycin (Invitrogen) and maintained 1% NP-40, 0.5% sodium deoxycholate, 0.05% SDS, and 50 mM 
at 37 °C in humidified 5% COp. Tris-HCl, pH 8.0). Proteins binding to the beads were eluted by adding 
2x SDS loading buffer and analyzed by Western blotting with anti- 
2.2. Transient expression of viral protein in mammalian cells Flag antibody. 
Constructs containing plasmid DNA under the control of a T7 pro- 
moter were transiently expressed in mammalian cells using a vaccinia 


virus-T7 system. Briefly, semiconfluent monolayers of HeLa cells were 2.5. Expression of GST fusion protein 

infected with 10 plaque forming units/cells of recombinant vaccinia The SARS-CoV N protein was cloned into pGEX-5X-1 (Amersham 
virus (VTF7-3), which expresses the T7 RNA polymerase gene, for Pharmacia Biotech) and expressed as GST-N fusion protein in E. coli 
2h at 37°C prior to transfection. The plasmid DNA was transfected BL21 cells. Both GST-N and GST alone were purified by affinity chro- 
into vIF7-3-infected cells using Effectene transfection reagent accord- matography using glutathione-Sepharose 4B (Amersham Pharmacia 


ing to the manufacturer’s instructions (Qiagen). Biotech). 
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2.6. Determination of cell division 

HeLa cells grown on coverslips were transfected with appropriate 
constructs expressing the N protein or SUMO-1, and fixed with 4% 
paraformaldehyde. The number of cells undergoing cell division was 
determined by counting a total of 300 cells that expressing the green 
fluorescence protein (GFP). Statistical analysis was performed using 
Student’s ¢ test, and P values less than 0.05 were considered to be sta- 
tistically significant. 


2.7. Indirect immunofluorescence 

SARS-CoV N protein was transiently expressed in HeLa cells grown 
on 4-well chamber slides (IWAKI). After rinsing with phosphate-buf- 
fered saline (PBS), cells were fixed with 4% paraformaldehyde for 15 
min at room temperature and permeabilized with 0.2% Triton 
X-100, followed by incubation with specific antibodies diluted in fluo- 
rescence dilution buffer (PBS with 5% newborn calf serum) at room 
temperature for 2 h. Cells were then washed with PBS and incubated 
with FITC-conjugated anti-rabbit or anti-mouse secondary antibodies 
(Dako) in fluorescence dilution buffer at 4 °C for | h before mounting. 


2.8. Construction of plasmids 

Plasmid pcDNA3-N, which covers the SARS-CoV N sequence, was 
constructed by cloning an EcoRI/Notl digested PCR fragment into 
EcoRI/Notl digested pcDNA3.1(+). The PCR fragment was generated 
using primers (5'-CGGAATTCCGATGTCTGATAATG GACCC-3’) 
and (5'-AATAAATAGCGGCCGCTGCCTGAGTTG AATC-3’). 
pFlag-N was created by cloning a PstI/EcoRI digested PCR fragment 
into Pstl/EcoRI digested pK T0-Flag. The PCR fragment was generated 
using primers (5’-AACTGCAGCATGTCTG ATAATGGACCCC- 
3’) and (5‘-CGGAATTCCGTTATGCCTGAGTTGAATCAGC-3’). 
Plasmid pEGFP-N was generated by cloning an EcoRI- and BamHI- 
digested PCR fragment into EcoRI/BamHI digested pEGFP-N1 (Clon- 
tech). The two primers used to generate the PCR fragments are 
(5'-CGGAATTCCGATGTCTGATAATGGACCC-3’) and (5’-CG- 
GGATCCCGTGCCTGAGTTGA ATCAGC-3’). Plasmid pGEX-N 
was made by cloning a BamHI- and EcoRI-digested PCR fragment into 
BamHI/EcoRI digested pGEX-5X-1 (Pharmacia). The two primers 
used are (5‘-CGGGATCCCGATGTCTGATAATGGACCC-3’) and 
(5'-CGGAATTCTGCCTGAGTTGAA TCAGC-3’). The K62A 
mutant was introduced by two rounds of PCR as described in Liu 
et al. [16]. All constructs were confirmed by automated nucleotide 
sequencing. 

SUMO-1 was amplified from human cDNA derived from HeLa cells 
by PCR with primers (5'-TATCGGATCCCATGTCTGACCA- 
GGCAAAACC-3’) and (5'‘-CGGATC CTCGAGCTAAACTGTT- 
GAATGACCCCCCGT-3’). The purified PCR product was digested 
with BamHI and XhoI and cloned into BamHI/Xhol digested 
pcDNA3.1(+) to generate peDNA3-SUMO-1. The construct was con- 
firmed by automated nucleotide sequencing. 


3. Results 


3.1. Expression of SARS-CoV N protein in bacterial and 
mammalian cells 

To study its biochemical properties and functions in viral 
replication, virion assembly and virus—host interaction, we 
cloned and expressed the SARS-CoV N protein in bacterial 
and mammalian cells. As seen in Fig. 1B, expression of the 
protein in E. coli BL21 cells as a GST fusion protein showed 
the detection of a single band of approximately 70 kDa, repre- 
senting the GST-N fusion protein (Fig. 1B, lane 2). The pro- 
tein could be purified by using the GST resin (Fig. 1B, lane 
2). Expression of the N protein tagged with the nine amino 
acid Flag tag at its N-terminus in mammalian cells showed 
the detection of multiple bands instead of a single protein band 
(Fig. 1B, lane 4). Similar to the recently reported detection of 
three major isoforms [15], a major protein species of approxi- 
mately 48 kDa and two slightly less abundant species, which 
migrate more rapidly than the 48-kDa band, were detected 
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(Fig. 1B, lane 4). These three isoforms may represent the 
full-length and posttranslationally modified forms of the N 
protein. In addition, three other species with approximately 
molecular masses of 175, 85, and 82 kDa, which may represent 
oligomerization of the N protein, were sometimes detectable 
under reducing conditions (Fig. 1B, lane 4). 


3.2. Posttranslational modification of SARS-CoV N protein by 
sumoylation 

The detection of multiple protein species with a wide range 
of molecular masses when the N protein was expressed in 
mammalian cells indicates that the protein may undergo 
other posttranslational modification, in addition to the 
known phosphorylation. As N protein does not contain 
any cysteine residue, the detection of potential oligomers sug- 
gests that it may form higher-order structures through other 
interactions. Sumoylation is one of the posttranslational 
modifications that affect the migration of the protein. To 
detect if the N protein is modified by sumoylation, cells 
transfected with pFlag-N were lysed either with a lysis buffer 
containing two isopeptidase inhibitors, iodoacetamide (IAA, 
10 mM) and N-ethylmaleimide (NEM, 20 mM), or with the 
Laemmli protein gel loading buffer preheated to 80°C, and 
subjected to Western blot analysis with anti-Flag antibody. 
As can be seen in Fig. 2A, in addition to the three major iso- 
forms of N protein that were detected under all conditions, a 
protein species of approximately 65 kDa was detected in cell 
lysates prepared with lysis buffer containing IAA and NEM 
(Fig. 2A, lanes 2, 3, and 5). Occasionally, a band of approx- 
imately 55kDa was also detected in cell lysates prepared 
with buffer containing IAA and NEM (Fig. 2A, lane 3). 
Co-expression of N protein with SUMO-1 led to the detec- 
tion of significant more 65 kDa species (Fig. 2A, lanes 2 
and 3). The 65-kDa band was also detected when cells were 
lysed directly with preheated SDS loading buffer (Fig. 2A, 
lane 6). During the course of this study, we noted that the 
65-kDa band could also be efficiently detected under nonre- 
ducing conditions (Fig. 2A, lanes 5 and 6). Under both 
reducing and nonreducing conditions, the three major iso- 
forms and the 65-kDa band are migrating at the same posi- 
tions. This would allow detection of N proteins in the 
subsequent co-immunoprecipitation experiments using SDS— 
PAGE under non-reducing conditions. The reason for choos- 
ing the non-reducing conditions in these assays is that the 
IgG heavy chain would mask the detection of the three 
major N isoforms under reducing conditions. 

The molecular mass of 65 kDa and its biochemical proper- 
ties suggest that this species may represent the sumoylation 
of the N protein. To confirm this possibility further, cell lysates 
were subjected to immunoprecipitation with either anti-Flag or 
anti-SUMO-1 antibody, and then analyzed by Western blot- 
ting with anti-Flag antibody. The results showed that anti-Flag 
antibody specifically precipitated the three major isoforms of 
N protein from cells transfected with pFlag-N (Fig. 2B, lanes 
2 and 3). In addition, the 65-kDa species was also detected 
(Fig. 2B, lanes 2 and 3). Analysis of the anti-SUMO-1 precip- 
itates by Western blotting with anti-Flag antibody showed that 
only the 65-kDa band was detected (Fig. 2B, lanes 5 and 6). 
Once again, co-expression of N protein with SUMO-1 greatly 
increased the detection of the 65 kDa species (Fig. 2B, lanes 2, 
3, 5, and 6). No N protein bands were detected from cells 
transfected with SUMO-1 alone (Fig. 2B, lanes 1 and 4). These 
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Fig. 2. Modification of SARS-CoV N protein by SUMO-1. (A) Analysis of sumoylation of N protein by Western blotting. Cell lysates prepared from 
HeLa cells overexpressing either the Flag-tagged N protein alone (lanes | and 2) or together with SUMO-1 (lanes 3, 4, 5, and 6) were prepared either 
in the presence (lanes 2, 3 and 5) or absence (lanes | and 4) of the isopeptidase inhibitors IAA and NEM or by direct lysis in preheated SDS loading 
buffer (lane 6). The polypeptides were separated by SDS-PAGE under either reducing (lanes 1-3) or non-reducing (lanes 4-6) conditions and 
analyzed by Western blotting with anti-Flag antibody. The three major isoforms of N protein are indicated by brackets and the major SUMO-1 
modified form of N protein is indicated by asterisks. Numbers on the left indicate molecular masses in kilodaltons. (B) Analysis of sumoylation of N 
protein by immunoprecipitation and Western blotting. Total cell lysates were prepared, in the presence of the isopeptidase inhibitors IAA and NEM, 
from HeLa cells expressing SUMO-1 (lanes | and 4), Flag-N (lanes 2 and 5), and Flag-N + SUMO-1 (lanes 3 and 6), and immunoprecipitated with 
either anti-Flag (lanes 1, 2 and 3) or anti-SUMO-1 antibody (lanes 4, 5, and 6). The immunoprecipitated proteins were separated by SDS-PAGE 
under nonreducing conditions and analyzed by Western blotting with anti-Flag antibody. The major SUMO-1 modified forms of N protein are 


indicated by asterisks, and the immunoglobulin is indicated by Ig. 


results confirmed that the 65kDa band represents the 
sumoylated N protein. 


3.3. Mapping of the sumoylation site on SARS-CoV N protein 

The consensus motif for sumoylation has been defined as a 
tetrapeptide ‘YK XE (where V is usually a hydrophobic residue 
with exceptions, and X is any amino acid) that surrounds the 
acceptor lysine in target proteins [23]. Analysis of the N pro- 
tein sequence showed that it contains 27 lysine residues. One 
lysine residue at amino acid position 62, K62, lies roughly 
within the consensus SUMO-1 modification sequence (GKEE) 
(Fig. 1A). To determine whether this lysine was responsible for 
the modification of N protein by sumoylation, it was mutated 
to an Ala by site-directed mutagenesis. Proteins extracted from 
cells transfected with wild-type N and K62A mutant were 
immunoblotted with anti-Flag antibody. As shown in Fig. 3, 
similar amounts of the three isoforms of N protein were de- 
tected from cells transfected with either wild type or mutant 
N constructs (Fig. 3, lanes 1 and 2). The 65 kDa sumoylated 
band was detected from cells transfected with wild-type N pro- 
tein only (Fig. 3, lane 1); no 65 kDa sumoylated form was de- 
tected from cells expressing the K62A mutant (Fig. 3, lane 2). 
These results demonstrate that the K62 residue is the major 
sumoylation site of N protein. 


3.4. Promotion of homo-oligomerization of SARS-CoV N 
protein by sumoylation 

It has been well documented that the ability of viral nucleo- 
capsid protein to interact with itself to form homo-oligomers is 
fundamental to the process of viral particle assembly. Recent 
studies showed that SARS-CoV N protein exhibits intrinsic 
properties of self-interaction [8,27]. Multimerization of N pro- 
tein was observed both in vitro and in vivo [8,27]. To study the 
effects of sumoylation on the homo-oligomerization of N pro- 
tein, cells expressing N protein alone or together with SUMO- 
1 were analyzed. As shown in Fig. 4, Western blot analysis of 
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Fig. 3. Mapping of the major sumoylation site on SARS-CoV N 
protein. Cell lysates from HeLa cells overexpressing wild type (lane 1) 
and K62A mutant N protein (lane 2) together with SUMO-1 were 
prepared in the presence of IAA and NEM. The polypeptides were 
separated by SDS-PAGE under nonreducing conditions and analyzed 
by Western blotting with anti-Flag antibody. The three major isoforms 
of N protein are indicated by brackets and the major SUMO-1 
modified form of N protein is indicated by asterisks. Numbers on the 
left indicate molecular masses in kilodaltons. 


cells expressing wild type N protein showed the detection of 
the three major isoforms of N protein and the 65 kDa sumoy- 
lated bands (Fig. 4A and B, lanes 1 and 2). In addition, two 
bands of approximately 85 and 175 kDa were detected (Fig. 
4A and B, lanes 1 and 2). Based on their apparent molecular 
masses, they may represent dimers and tetramers, respectively, 
of the N protein. 

The effects of sumoylation on the formation of these oligo- 
mers were then analyzed by expression of wild type and K62A 
mutant constructs in cells in the presence of SUMO-1. The 
results showed that co-expression of wild type N protein with 
SUMO-1 dramatically increased the detection of the 65-kDa 
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Fig. 4. Analysis of the homo-oligomerization of SARS-CoV N 
protein. (A) HeLa cells overexpressing wild type N protein only (lane 
1) or together with SUMO-1 (lane 2) were lysed in the absence (lane 1) 
or presence of IAA and NEM (lane 2). The polypeptides were 
separated by SDS-PAGE under reducing conditions and analyzed by 
Western blotting with anti-Flag antibody. The three major isoforms of 
N protein are indicated by brackets and the major SUMO-1 modified 
form of N protein is indicated by asterisks. The dimers are also 
indicated. Numbers on the left indicate molecular masses in kilodal- 
tons. (B) HeLa cells overexpressing wild type N protein only (lanes | 
and 2), wild type + SUMO-1 (lanes 3 and 4) or the K62A mutant N 
protein + SUMO-1 (lane 5) were lysed in the presence of IAA and 
NEM (lanes 1, 3 and 5) or with preheated SDS loading buffer (lanes 2 
and 4). The polypeptides were separated by SDS-PAGE under 
nonreducing conditions and analyzed by Western blotting with anti- 
Flag antibody. The three major isoforms of N protein are indicated by 
brackets and the major SUMO-1 modified form of N protein is 
indicated by asterisks. The dimers and tetramers are also indicated. 
Numbers on the left indicate molecular masses in kilodaltons. 


sumoylated band and the 85-kDa/175-kDa oligomers (Fig. 4B, 
lanes 3 and 4). Co-expression of the K62A mutant with 
SUMO-1, once again, showed no detection of the 65-kDa 
sumoylated N protein (Fig. 4B, lane 5). Interestingly, only a 
trace amount of the 85- and 175-kDa species was detected 
(Fig. 4B, lane 5). These results suggest that abolishment of 
the sumoylation of N protein by mutating the K62 sumoyla- 
tion site significantly decrease homo-oliogomerization of the 
protein. 


3.5. Further characterization of sumoylation-mediated homo- 
oligomerization of SARS-CoV N protein 

The effect of sumoylation on homo-oligomerization of N 
protein was further characterized by two independent co- 
immunoprecipitation experiments. First, cells expressing wild 
type and K62A mutant constructs were lysed with buffer con- 
taining IAA and NEM, immunoprecipitated with anti-Flag 
antibody. The immunoprecipitated proteins were separated 
in SDS-PAGE and analyzed by Western blotting with anti- 
Flag antibody. As shown in Fig. 5A, similar amounts of the 
three isoforms of N protein were detected from cells expressing 
the wild type and mutant constructs (Fig. 5A, lanes 2 and 3). 
However, significantly more 85-kDa dimers were detected in 
cells expressing the wild type construct than did from cells 
expressing the mutant (Fig. 5A, lanes 2 and 3). 

Second, the Flag-tagged wild type and mutant N constructs 
were co-expressed with a c-Myc-tagged wild type N construct. 
Cells were then lysed with buffer containing IAA and NEM, 
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Fig. 5. Further analysis of the homo-oligomerization of SARS-C-V N 
protein. (A) HeLa cells overexpressing the empty pK T0-Flag (lane 1), 
wild type N protein (lane 2), or the K62A mutant N protein (lane 3) 
were lysed in the presence of IAA and NEM. The lysates were 
immunoprecipitated with anti-Flag antibody. The precipitated poly- 
peptides were separated by SDS-PAGE under nonreducing conditions 
and analyzed by Western blotting with anti-Flag antibody. The three 
major isoforms of N protein, the N protein dimer, and the immuno- 
globulin are indicated. Numbers on the left indicate molecular masses 
in kilodaltons. (B) HeLa cells overexpressing the Flag-tagged wild type 
N protein alone (lane 1), the c-Myc-tagged wild type N protein alone 
(lane 2), the Flag-tagged wild type and c-Myc-tagged wild type N 
protein (lane 3), and the Flag-tagged K62A mutant and c-Myc-tagged 
wild type N protein (lane 4) were lysed in the presence of IAA and 
NEM. Polypeptides were immunoprecipitated with anti-Myc anti- 
body, separated by SDS-PAGE under nonreducing conditions, and 
analyzed by Western blotting with anti-Flag antibody. The N protein 
dimer and the immunoglobulin are indicated. A band migrating at 
approximately 50 kDa, which may represent the antibody heavy chain, 
is also indicated. (C) HeLa cells overexpressing the Flag-tagged wild 
type N protein alone (lanes 1 and 5), the c-Myc-tagged wild type N 
protein alone (lanes 2 and 6), the Flag-tagged wild type and c-Myc- 
tagged wild type N protein (lanes 3 and 7), and the Flag-tagged K62A 
mutant and c-Myc-tagged wild type N protein (lanes 4 and 8) were 
lysed in the presence of IAA and NEM. Polypeptides were separated 
by SDS-PAGE under nonreducing conditions, and analyzed by 
Western blotting with either anti-Flag antibody (lanes 1-4), or anti- 
Myc antibody (lanes 5-8). The three major isoforms of N protein and 
the dimer are indicated. 


immunoprecipitated with anti-Myc antibody. The immuno- 
precipitated proteins were separated in SDS-PAGE and ana- 
lyzed by Western blotting with anti-Flag antibody. As 
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shown in Fig. 5B, the 85-kDa dimers were readily detected in 
cells co-expressing the Flag-tagged wild type and Myc-tagged 
wild type N protein (Fig. 5B, lane 3). Only a trace amount of 
the 85-kDa dimers was detected in cells co-expressing the 
Flag-tagged K62A mutant and Myc-tagged wild type N pro- 
tein (Fig. 5B, lane 4). In addition, a band migrating at approx- 
imately 50 kDa position was consistently detected (Fig. 5B). It 
might represent the heavy chain of the anti-Myc antibody 
used. 

To make sure that similar levels of N protein were expressed 
in the transfected cells, total cell lysates were analyzed by Wes- 
tern blotting with anti-Flag (Fig. 5C, lanes 1-4) and anti-Myc 
(Fig. 5C, lanes 5-8) antibodies. The results showed that the 
expression levels of flag-tagged wild type and mutant N pro- 
tein and the Myc-tagged N protein are approximately the same 
(Fig. 5C, lanes 1-4 and 5-8). Interestingly, Western blot anal- 
ysis of cells co-expressing Flag- and Myc-tagged wild type N 
protein with anti-Flag antibody showed readily detection of 
the 85- and 175-kDa oligomers (Fig. 5C, lane 3). Analysis of 
the same cell lysates with anti-Myc antibody, however, led to 
much less detection of the two forms (Fig. 5C, lane 7). The 
two bands were only detectable after prolonged exposure of 
the gel (data not shown). 


3.6. Effects of sumoylation on the subcellular localization of 
SARS-CoV N protein 

Subcellular distribution of N protein was first studied by 
cloning and expressing wild type and K62A mutant N protein 
as a fusion protein with the enhanced green fluorescent protein 
(EGFP). The plasmid was transfected into HeLa cells and 
incubated at 37°C for 36h. The majority of the SARS-CoV 
N-EGFP fusion protein was observed to be distributed 
throughout the cytoplasm (Fig. 6B). A certain proportion of 
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the fusion protein was also observed to be localized to the 
nucleolus (Fig. 6B). Similar cytoplasmic localization was 
observed when K62A-EGFP fusion protein was expressed in 
HeLa cells (Fig. 6C). However, much less, if any, nucleolar 
localization was seen in cells expressing this mutant construct 
(Fig. 6C). 

The wild type and K62A mutant N protein was then cloned 
into peDNA3.1(+) and expressed in HeLa cells. After incuba- 
tion at 37°C for 18h, cells were analyzed by indirect immu- 
nofluorescence using rabbit anti-SARS-CoV N antiserum, 
followed by FITC-labeled goat anti-rabbit antibody. Similar 
cytoplasmic localization pattern to the N-EGFP fusion pro- 
tein was observed in cells expressing wild type and mutant 
N protein (Fig. 6E and F). In some cells, wild type N protein 
exhibited typical nucleolar localization (Fig. 6E). Once 
again, less obvious nucleolar staining was observed in cells 
expressing the K62A mutant (Fig. 6F). These results suggest 
that sumoylation of the N protein might affect its nucleolar 
localization. 


3.7. Nucleolar localization of SARS-CoV N protein 

The nucleolar localization pattern of wild type N protein 
observed above was confirmed by its colocalization with 
fibrillarin in the nucleolus. As shown in Fig. 7, similar cyto- 
plasmic and nuclear localization pattern was observed in 
cells expressing the Flag-tagged N protein (A). Immunofluo- 
rescent staining with anti-fibrillarin antibodies showed typi- 
cal nucleolar staining of both transfected and untransfected 
cells (Fig. 7B). Interestingly, strong cytoplasmic staining 
was also observed in cells expressing the N protein using 
the same antibodies (Fig. 7B). The staining patterns co- 
aligned well with the patterns observed with anti-Flag 
antibodies (Fig. 7C). These results confirm the nucleolar 


Fig. 6. Subcellular localization of SARS-CoV N protein. HeLa cells expressing EGFP (A), N-EGFP (B) and K62A-EGFP fusion protein (C) were 
detected directly under the fluorescence microscope at 36h posttransfection. Indirect immunofluorescent staining of HeLa cells transfected with 
empty plasmid (D), wild type N protein (E) and K62A mutant (F), was carried out at 18h posttransfection with rabbit anti-SARS N antisera and 
FITC-labeled goat anti-rabbit antibodies. The multinucleated cells are indicated by arrows. 
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Fig. 7. Nucleolar localization of SARS-CoV N protein. Indirect immunofluorescent staining of HeLa cells expressing the Flag-tagged N protein was 
carried out at 18h posttransfection with mouse anti-Flag (A) and rabbit anti-fibrillarin (B) antisera. The N protein was then detected by TRITC- 
conjugated anti-mouse secondary antibodies and fibrillarin was detected by FITC-labeled anti-rabbit antibodies. (C) represents the merged images. 
All images were taken using a Zeiss LSM510 META laser scanning confocal microscope. 


localization of the N protein, and further demonstrate that 
N protein may physically interact with fibrillarin, resulting 
in the retention of fibrillarin in the cytoplasm of cells 
expressing the N protein. 


3.8. Effects of sumoylation on SARS-CoV N protein-mediated 
disruption of host cell division 

Similar to other coronavirus N protein, overexpression of 
SARS-CoV N protein in mammalian cells disrupts the cell 
division, as certain proportion of cells expressing the protein 
at various time points posttransfection were apparently under- 
going cell division (Fig. 6). Furthermore, much less multinucle- 
ated cells were observed in cells expressing the K62A mutant 
(Fig. 6). To quantitate the multinucleated cells, HeLa cells 
transfected with pEGFP, pEGFP-N, pEGFP-N + SUMO-1, 
and pEGFP-N(K62A) + SUMO-1 were examined under the 


Percent of cells undergoing d 


fluorescence microscope at 24, 36, 48 and 60h posttransfec- 
tion, respectively, and the multinucleated cells were counted 
among 300 cells expressing GFP. As shown in Fig. 8, over 
25% of HeLa cells expressing pEGFP-N were observed to be 
undergoing cell division at all time points. The percentages 
of cells undergoing cell division increased to 31-36% among 
GFP-positive cells when pEGFP-N were co-expressed with 
SUMO-1. However, the percentages of cells undergoing cell 
division were markedly reduced to 11-15% in cells over- 
expressing the K62A mutant and SUMO-1 (Fig. 8), signifi- 
cantly less than (P < 0.05) the multinucleated cells observed 
when the wild type construct was expressed. In a control exper- 
iment, the percentages of multinucleated cells that overexpress 
GFP only were between 2% and 3%. These results indicate that 
sumoylation of the N protein may play certain roles in its 
interference and disruption of host cell division. 


Op EGFP-N1 
Op EGFP-N 

tp EGFP-N + pcDNASUMO-1 

Op EGFP-N KB2A+ pcDNASUMO-1 


Hours post-transfection 


Fig. 8. Effects of sumoylation of SARS-CoV N protein on its interference of host cell division. Percentages of multinucleated cells among HeLa cells 
expressing EGFP, N-EGFP, N-EGFP + SUMO-1, and K62A-EGFP + SUMO-1 were calculated by counting the multinucleated cells among 300 
green cells under the fluorescence microscope. The percentages and S.D. are results of three repeated experiments. 
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4. Discussion 


In this study, we showed that, in addition to phosphoryla- 
tion, the SARS-CoV N protein was modified by covalent 
attachment of SUMO to its “lysine residue. Evidence 
provided demonstrated that sumoylation may promote 
homo-oligomerization of the protein. It may also play certain 
roles in the N protein-mediated interference of host cell 
division. 

Sumoylation is a highly regulated process in all eukaryotes, 
involving in diverse regulatory events, such as nuclear trans- 
port, transcriptional regulation, chromosome segregation and 
cell-cycle control [18,23,30]. Conjugation of SUMO to a pro- 
tein involves the formation of an isopeptide bond between the 
C-terminal glycine of SUMO and the ¢-amino group of lysine 
residue in the target protein. Although SUMO-1 has an esti- 
mated molecular mass of approximately 12 kDa, earlier stud- 
ies demonstrated that many SUMO-conjugated proteins 
usually have a size increase of approximately 20 kDa after 
conjugation by one SUMO-1 molecule. SUMO conjugated 
proteins are typically unstable and can be rapidly hydrolyzed 
by SUMO-1 hydrolase during purification. This modification 
usually alters or regulates the main function of the target 
proteins. 

Coronavirus N protein is a multi-functional protein [14,31]. 
Among them, the most prominent function of the protein is 
to wrap up the RNA genome to form RNP and assemble 
into the nucleocapsid core, due to its RNA-binding activities 
and self-association properties. Studies with other coronavi- 
rus N protein have mapped the RNA binding domain to 
the N-terminal one third region of the protein [14]. A basic 
amino acid stretch between amino acids 238 and 293 may 
be responsible for the RNA binding activities of the corona- 
virus infectious bronchitis virus N protein [34]. Interactions 
between N protein and RNA are generally required for 
encapsidation of viral genomic RNA. It is possible that 
RNA could promote N-N interactions by neutralizing charge 
repulsions between the two stretches of basic amino acids. As 
the K62 residue is located in the putative RNA-binding 
domain of the SARS-CoV N protein, is the reduced homo- 
oligomerization of the K62A mutant reported in this study 
due to its loss of the RNA binding activity? Although we 
do not know if this mutation could affect the RNA-binding 
activities of the protein, co-expression of the Flag-tagged 
K62A mutant with the Myc-tagged wild type N protein re- 
sulted in the detection of remarkably less 85-kDa dimers than 
in cells co-expressing the Flag- and Myc-tagged wild type N 
protein. As two different antibodies were used in the immu- 
noprecipitation and Western blot analyses, it virtually rules 
out the possibility that the detection of less 85- and 175- 
kDa oligomers in Western blot studies involving a single anti- 
body is due to a weaker RNA binding activity of the mutant 
construct. 

Self-association and homo-oligomerization are another 
essential property of the coronavirus N protein. Two recent 
studies have shown that the S/R-rich motif and the C-terminal 
209 amino acids are essential for self-association and multi- 
merization of the SARS-CoV N protein [8,27]. The findings 
in this report indicate that the N protein is capable of forming 
dimers and higher order multimers. Similar to the equivalent 
protein of equine arteritis virus and simian hemorrhagic fever 
virus, SARS N protein does not contain any cysteine in its 422 
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amino acid residues. This excludes the possibility that disulfide 
bonds are the primary force that mediates the initial N—N 
interactions. In fact, disulfide bonds do not appear to be rele- 
vant until the virus enters the secretory pathway (ER and 
Golgi) and/or egresses from the cell, both events occurred fol- 
lowing the core particle assembly. Data reported in this study 
demonstrate that sumoylation of the SARS-CoV N protein dra- 
matically enhances the homo-oligomerization of the protein. 
Promotion of oligomerization of protein by sumoylation has 
been speculated for a pathogenic protein, Huntingtin [25]. Since 
self-association and homo-oligomerization of N protein are 
essential for the assembly of nucleocapsid core, it suggests that 
sumoylation would play an important role in the SARS-CoV 
replication cycles. Systematic testing this possibility would rely 
on the availability of an infectious cloning system, as developed 
by Yount et al. [33]. 

The apparent molecular weight of the dimer of the N protein 
detected in this report is 85 kDa, suggesting that it does not 
contain the SUMO conjugate. The failure to detect the sumoy- 
lated dimer in this study is unexpected, considering that 
sumoylation was shown to promote dimerization of the N pro- 
tein. Two possibilities have been considered. First, sumoyla- 
tion is a highly reversible process. The current data showed 
that only a small proportion of the N protein was dimerized 
compared to the monomers, and a certain proportion of the 
sumoylated dimer may be reversed during sample preparation 
and detection. The combination of these two factors would 
hamper the detection of the sumoylated dimers by the ap- 
proaches used. The second possibility is that the sumoylated 
N protein may be not directly involved in the formation of di- 
mers and other oligomers. Instead, it may target the N protein 
to different cellular compartments and facilitate the oligomer- 
ization of the N protein. Further studies are required to ad- 
dress these possibilities. 

Site-directed mutagenesis studies mapped the “lysine residue 
as a major site for covalent attachment of SUMO to the pro- 
tein, as the 65-kDa sumoylated band cannot be detected when 
the mutant construct was expressed in cells. We do not know if 
other minor sumoylation sites may exist in the SARS-CoV N 
protein. Potential sumoylation at these minor positions would 
compensate the effect of K62A mutation, and complicate the 
interpretation of the data generating from functional studies, 
such as the partial interference of cell division by the K62A mu- 
tant N protein. Sumoylation of protein at multiple sites was re- 
cently reported for several viral and host proteins. For example, 
the Epstein-Barr virus Rta protein was shown to be sumoy- 
lated at three alternating sites [2]. As the SARS-CoV N protein 
contains a total of 27 lysine residues and no any other lysine res- 
idue is located in a consensus sequence context for sumoyla- 
tion, it would be difficult to further define these sites, if any, 
by a conventional mutagenesis approach. 

SARS-CoV N protein has been found to be translocated to 
the nucleolus of host cells, resembling other coronavirus N 
proteins, such as IBV [10], MHV and TGEV [31]. This might 
be a common strategy for coronaviruses to control both host 
and viral RNA translation [10]. However, the functional signif- 
icance and the mechanism by which the N protein translocates 
to the nucleolus are yet to be determined. Systematic investiga- 
tion of NuLS of the SARS-CoV N protein that might target it 
to the nucleolus will be carried out based on sequence compar- 
ison with other coronaviruses. An alternative possibility that 
may account for the nucleolar localization of a viral protein 
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is its interaction with certain nucleolar antigens such as fibrill- 
arin and nucleolin. This was demonstrated on IBV and MHV 
N proteins [4]. Sumoylation of certain host proteins can regu- 
late their nucleo-cytoplasmic shuttling [22,32]. The K62A mu- 
tant exhibits much less nucleolus localization, indicating that 
sumoylation may regulate the subcellular localization of the 
SARS-C-V N protein. 
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